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PROTEIN SCAFFOLDS FOR ANTIBODY MIMICS 
5 AND OTHER BINDING PROTEINS 

Background of the Invention 
This invention relates to protein scaffolds useful, for example, for 
the generation of products having novel binding characteristics. 

Proteins having relatively defined three-dimensional structures, 
10 commonly referred to as protein scaffolds, may be used as reagents for the 
design of engineered products. These scaffolds typically contain one or more 
regions which are amenable to specific or random sequence variation, and such 
sequence randomization is often carried out to produce libraries of proteins 
firom which desired products may be selected. One particular area in which 
15 such scaffolds are useful is the field of antibody design. 

A number of previous approaches to the manipulation of the 
mammalian immune system to obtain reagents or drugs have been attempted. 
These have included injecting animals with antigens of interest to obtain 
mixtures of polyclonal antibodies reactive against specific antigens, production 
20 of monoclonal antibodies in hybridoma cell culture (Koehler and Milstein, 
Nature 256:495, 1975), modification of existing monoclonal antibodies to 
obtain new or optimized recognition properties, creation of novel antibody 
fi:agments with desirable binding characteristics, and randomization of single 
chain antibodies (created by connecting the variable regions of the heavy and 
25 light chains of antibody molecules with a flexible peptide linker) followed by 
selection for antigen binding by phage display (Clackson et al,, Nature 
352:624, 1991). 
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In addition, several non-immunoglobulin protein scaffolds have 
been proposed for obtaining proteins with novel binding properties. For 
example, a "minibody" scaffold, which is related to the inomunoglobulin fold, 
has been designed by deleting three beta strands from a heavy chain variable 
5 domain of a monoclonal antibody (Tramontano et al,, J. Mol. Recognit. 7:9, 
1994). This protein includes 61 residues and can be used to present two 
hypervariable loops. These two loops have been randomized and products 
selected for antigen binding, but thus far the framework appears to have 
somewhat limited utiUty due to solubility problems. Another framework used 

10 to display loops has been tendamistat, a 74 residue, six-strand beta sheet 

sandwich held together by two disulfide bonds (McConnell and Hoess, J. Mol. 
Biol. 250:460, 1995). This scaffold includes three loops, but, to date, only two 
of these loops have been examined for randomization potential. 

Other proteins have been tested as frameworks and have been used 

15 to display randomized residues on alpha helical surfaces (Nord et al., Nat. 
Biotechnol. 15:772, 1997; Nord et al.. Protein Eng. 8:601, 1995), loops 
between alpha helices in alpha helix bundles (Ku and Schultz, Proc. Natl. 
Acad. Sci. USA 92:6552, 1995), and loops constrained by disulfide bridges, 
such as those of the small protease inhibitors (Markland et al., Biochemistry 

20 35:8045, 1996; Markland et al., Biochemistry 35:8058, 1996; Rottgen and 
Collins, Gene 164:243, 1995; Wang et al., J. Biol. Chem. 270:12250, 1995). 

Summary of the Invention 
The present invention provides a new family of proteins capable of 
evolving to bind any compound of interest. These proteins, which make use of 
25 a fibronectin or fibronectin-like scaffold, function in a manner characteristic of 
natural or engineered antibodies (that is, polyclonal, monoclonal, or 
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single-chain antibodies) and, in addition, possess structural advantages. 
Specifically, the structure of these antibody mimics has been designed for 
optimal folding, stability, and solubility, even under conditions which normally 
lead to the loss of stmcture and function in antibodies. 
5 These antibody minoics may be utilized for the purpose of designing 

proteins which are capable of binding to virtually any compound (for example, 
any protein) of interest. In particular, the fibronectin-based molecules 
described herein may be used as scaffolds which are subjected to directed 
evolution designed to randomize one or more of the three fibronectin loops 

10 which are analogous to the complementarity-deternoining regions (CDRs) of an 
antibody variable region. Such a directed evolution approach results in the 
production of antibody-like molecules with high affinities for antigens of 
interest. In addition, the scaffolds described herein may be used to display 
defined exposed loops (for example, loops previously randonaized and selected 

15 on the basis of antigen binding) in order to direct the evolution of molecules 
that bind to such introduced loops. A selection of this type may be carried out 
to identify recognition molecules for any individual CDR-like loop or, 
alternatively, for the recognition of two or all three CDR-like loops combined 
into a non-linear epitope. 

20 Accordingly, the present invention features a protein that includes a 

fibronectin type lU domain having at least one randomized loop, the protein 
being characterized by its abihty to bind to a compound that is not bound by 
the corresponding naturally-occurring fibronectin. 

In preferred embodiments, the fibronectin type HI domain is a 

25 mammalian (for example, a human) fibronectin type HI domain; and the 

protein includes the tenth module of the fibronectin type HI (^^n3) domain. In 
such proteins, compound binding is preferably mediated by either one, two, or 
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three ^^Fn3 loops. In other preferred embodiments, the second loop of ^^'FnS 
may be extended in length relative to the naturally-occurring module, or the 
^^n3 may lack an integrin-binding motif. In ±ese molecules, the integrin- 
binding motif may be replaced by an amino acid sequence in which a basic 
5 amino acid-neutral amino acid-acidic amino acid sequence (in the N-terminal 
to C-terminal direction) replaces the integrin-binding motif; one preferred 
sequence is serine-glycine-glutamate. In another preferred embodiment, the 
fibronectin type HI domain-containing proteins of the invention lack disulfide 
bonds. 

10 Any of the fibronectin type HE domain-containing proteins described 

herein may be formulated as part of a fusion protein (for example, a fusion 
protein which further includes an immunoglobulin domain, a complement 
protein, a toxin protein, or an albumin protein). In addition, any of the 
fibronectin type HI domain proteins may be covalently bound to a nucleic acid 

15 (for example, an RNA), and the nucleic acid may encode the protein. 

Moreover, the protein may be a multimer, or, particularly if it lacks an integrin- 
binding motif, it may be formulated in a physiologically-acceptable carrier. 

The present invention also features proteins that include a 
fibronectin type III domain having at least one mutation in a P-sheet sequence 

20 which changes the scaffold structure. Again, these proteins are characterized 
by their ability to bind to compounds that are not bound by the corresponding 
naturally-occurring fibronectin. 

In addition, any of the fibronectin scaffolds of the invention may be 
inamobilized on a solid support (for example, a bead or chip), and these 

25 scaffolds may be arranged in any configuration on the solid support, including 
an array. 
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In a related aspect, the invention further features nucleic acids 
encoding any of the proteins of the invention^ In preferred embodiments, the 
nucleic acid is DNA or RNA, 

In another related aspect, the invention also features a method for 
5 generating a protein which includes a fibronectin type EI domain and which is 
pharmaceutically acceptable to a naanmial, involving removing the integrin- 
binding domain of said jBbronectin type HI domain. This method may be 
applied to any of the fibronectin type IE domain-containing proteins described 
above and is particularly useful for generating proteins for human therapeutic 

10 applications. The invention also features such fibronectin type DI domain- 
containing proteins which lack integrin-binding domains. 

In yet other related aspects, the invention features screening methods 
which may be used to obtain or evolve randomized fibronectin type HI proteins 
capable of binding to compounds of interest, or to obtain or evolve compounds 

15 (for example, proteins) capable of binding to a particular protein containing a 
randomized fibronectin type III motif. In addition, the invention features 
screening procedures which combine these two methods, in any order, to 
obtain either compounds or proteins of interest. 

In particular, the first screening method, useful for the isolation or 

20 identification of randomized proteins of interest, involves: (a) contacting the 
compound with a candidate protein, the candidate protein including a 
fibronectin type HI domain having at least one randomized loop, the contacting 
being carried out under conditions that allow compound-protein complex 
formation; and (b) obtaining, from the complex, the protein which binds to the 

25 compound. 
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The second screening method, for isolating or identifying a 
compound which binds to a protein having a randomized fibronectin type HI 
domain, involves: (a) contacting the protein with a candidate compound, the 
contacting being carried out under conditions that allow compound-protein 
5 complex formation; and (b) obtaining, from the complex, the compound which 
binds to ±e protein. 

In preferred embodiments, the methods further involve either 
randomizing at least one loop of the fibronectin type in domain of the protein 
obtained in step (b) and repeating steps (a) and (b) using the further 

10 randomized protein, or modifying the compound obtained in step (b) and 

repeating steps (a) and (b) using the further modified compound. In addition, 
the compound is preferably a protein, and the fibronectin type III domain is 
preferably a mammalian (for example, a human) fibronectin type HI domain. 
In other preferred embodiments, the protein includes the tenth module of the 

15 fibronectin type HI domain (^^n3), and binding is mediated by one, two, or 
three ^^n3 loops. In addition, the second loop of ^^n3 may be extended in 
length relative to ±e naturally-occurring module, or ^^n3 may lack an 
integrin-binding motif. Again, as described above, the integrin-binding motif 
may be replaced by an amino acid sequence in which a basic amino acid- 

20 neutral amino acid-acidic amino acid sequence (in the N-terminal to C-termmal 
direction) replaces the integrin-binding motif; one preferred sequence is serine- 
glycine-glutamate. 

The selection methods described herein may be carried out using any 
fibronectin type HI domain-containing protein. For example, the fibronectin 

25 type HI domain-containing protein may lack disulfide bonds, or may be 
formulated as part of a fusion protein (for example, a fusion protein which 
further includes an immunoglobulin F^. domain, a complement protein, a toxin 
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protein, or an albumin protein). In addition, selections may be carried out 
using the fibronectin type m domain proteins covalently bound to nucleic 
acids (for example, RNAs or any nucleic acid which encodes the protein). 
Moreover, the selections may be carried out using fibronectin domain- 
5 containing protein multimers. 

Preferably, the selections involve the immobilization of the binding 
target on a solid support. Preferred solid supports include colunms (for 
example, affinity columns, such as agarose columns) or microchips. 

In addition, the invention features diagnostic methods which employ 

10 the fibronectin scaffold proteins of the invention. Such diagnostic methods 
may be carried out on a sample (for example, a biological sample) to detect one 
analyte or to simultaneously detect many different analytes in the sample. The 
method may employ any of the scaffold molecules described herein. 
Preferably, the method involves (a) contacting the sample with a protein which 

15 binds to the compound analyte and which includes a fibronectin type HI 

domain having at least one randomized loop, the contacting being carried out 
under conditions that allow compound-protein complex formation; and (b) 
detecting the complex, and therefore the compound in the sample. 

In preferred embodiments, the protein is immobilized on a solid 

20 support (for example, a chip or bead) and may be immobilized as part of an 
array. The protein may be covalently bound to a nucleic acid, preferably, a 
nucleic acid, such as RNA, that encodes the protein. In addition, the 
compound is often a protein, but may also be any other analyte in a sample. 
Detection may be accomplished by any standard technique including, without 

25 limitation, radiography, fluorescence detection, mass spectroscopy, or surface 
plasmon resonance. 
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As used herein, by "fibronectin type EI domain" is meant a domain 
having 7 or 8 beta strands which are distributed between two beta sheets, 
which themselves pack against each other to form the core of the protein, and 
fiarther containing loops which connect the beta strands to each other and are 
5 solvent exposed. There are at least three such loops at each edge of the beta 
sheet sandwich, where the edge is the boundary of the protein perpendicular to 
the direction of the beta strands. Preferably, a fibronectin type HI domain 
includes a sequence which exhibits at least 30% amino acid identity, and 
preferably at least 50% amino acid identity, to the sequence encoding the 
10 structure of the ^^n3 domain referred to as "Ittg" (ID = "Ittg" (one ttg)) 
available from the Protein Data Base. Sequence identity referred to in this 
definition is determined by the Homology program, available from Molecular 
Simulation (San Diego, CA). The invention further includes polymers of 
^'^nS-related molecules, which are an extension of the use of the monomer 
15 structure, whether or not the subunits of the polyprotein are identical or 
different in sequence. 

By "naturally occurring fibronectin" is meant any fibronectin protein 
that is encoded by a living organism. 

By "randomized" is meant including one or more amino acid 
20 alterations relative to a template sequence. 

By a "protein" is meant any sequence of two or more anndno acids, 
regardless of length, post-translation modification, or function. 'Trotein" and 
"peptide" are used interchangeably herein. 

By "RNA" is meant a sequence of two or more covalently bonded, 
25 naturally occurriug or modified ribonucleotides. One example of a modified 
RNA included within this term is phosphorothioate RNA. 
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By "DNA" is meant a sequence of two or more covalently bonded, 
naturally occurring or modified deoxyribonucleotides. 

By a "nucleic acid" is meant any two or more covalently bonded 
nucleotides or nucleotide analogs or derivatives. As used herein, this term 
5 includes, without limitation, DNA, RNA, and PNA. 

By "pharmaceutically acceptable" is meant a compound or protein 
that may be administered to an animal (for example, a mammal) without 
significant adverse medical consequences. 

By "physiologically acceptable carrier" is meant a carrier which does 
10 not have a significant detrimental impact on the treated host and which retains 
the therapeutic properties of the compound with which it is administered. One 
exemplary physiologically acceptable carrier is physiological saline. Other 
physiologically acceptable carriers and their formulations are known to one 
skilled in the art and are described, for example, in Remington's 
15 Pharmaceutical Sciences , (18* edition), ed. A. Gennaro, 1990, Mack 
Publishing Company, Easton, PA, incorporated herein by reference. 

By "selecting" is meant substantially partitioning a molecule from 
other molecules in a population. As used herein, a "selecting" step provides at 
least a 2-fold, preferably, a 30-fold, more preferably, a 100-fold, and, most 
20 preferably, a 1000-fold enrichment of a desired molecule relative to undesired 
molecules in a population following the selection step. A selection step may 
be repeated any number of times, and different types of selection steps may be 
combined in a given approach. 

By "binding partner," as used herein, is meant any molecule which 
25 has a specific, covalent or non-covalent affinity for a portion of a desired 
compound (for example, protein) of interest. Examples of binding partners 
include, without limitation, members of antigen/antibody pairs, 
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protein/inhibitor pairs, receptor/ligand pairs (for example cell surface 
receptor/ligand pairs, such as hormone receptor/peptide hormone pairs), 
enzyme/substrate pairs (for example, kinase/substrate pairs), 
lectin/carbohydrate pairs, oligomeric or heterooligomeric protein aggregates, 
5 DNA binding protein/DNA binding site pairs, RNA/protein pairs, and nucleic 
acid duplexes, heteroduplexes, or ligated strands, as well as any molecule 
which is capable of forming one or more covalent or non-covalent bonds (for 
example, disulfide bonds) with any portion of another molecule (for example, a 
compoimd or protein). 

10 By a "solid support" is meant, without limitation, any column (or 

column material), bead, test tube, microtiter dish, solid particle (for example, 
agarose or sepharose), microchip (for example, siUcon, silicon-glass, or gold 
chip), or membrane (for example, the membrane of a liposome or vesicle) to 
which a fibronectin scaffold or an affinity complex may be bound, either 

15 directly or indirectly (for example, through other binding partner intermediates 
such as other antibodies or Protein A), or in which a fibronectin scaffold or an 
affinity complex may be embedded (for example, through a receptor or 
channel). 

The present invention provides a number of advantages. For 
20 example, as described in more detail below, the present antibody mimics 
exhibit improved biophysical properties, such as stability under reducing 
conditions and solubility at high concentrations. In addition, these molecules 
may be readily expressed and folded in prokaryotic systems, such as E. coli, in 
eukaryotic systems, such as yeast, and in in vitro translation systems, such as 
25 the rabbit reticulocyte lysate system. Moreover, these molecules are extremely 
amenable to affinity maturation techniques involving multiple cycles of 
selection, including in vitro selection using RNA-protein fusion technology 
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(Roberts and Szostak, Proc. Natl Acad. Sci USA 94:12297, 1997; Szostak et 
al., U.S.S.N. 09/007,005 and U.S.S.R 09/247,190; Szostak et al. 
WO98/31700), phage display (see, for example, Smith and Petrenko, Chem. 
Rev. 97:317, 1997), and yeast display systems (see, for example, Boder and 
5 Wittrap, Nature Biotech. 15:553, 1997), 

Other features and advantages of the present invention will be 
apparent from the following detailed description thereof, and from the claims. 

Brief Description of the Drawings 
FIGURE 1 is a photograph showing a comparison between the 
10 structures of antibody heavy chain variable regions from camel (dark blue) and 
Uama (light blue), in each of two orientations. 

FIGURE 2 is a photograph showing a comparison between the 
structures of the camel antibody heavy chain variable region (dark blue), the 
llama antibody heavy chain variable region (light blue), and a fibronectin type 
15 m module number 10 (^^n3) (yellow). 

FIGURE 3 is a photograph showing a fibronectin type in module 
number 10 (^°Fn3), with the loops corresponding to the antigen-binding loops 
in IgG heavy chains highlighted in red. 

FIGURE 4 is a graph illustrating a sequence alignment between a 
20 fibronectin type EI protein domain and related protein domains, 

FIGURE 5 is a photograph showing the structural similarities 
between a ^^n3 domain and 15 related proteins, including fibronectins, 
tenascins, collagens, and undulin. In this photograph, the regions are labeled 
as follows: constant, dark blue; conserved, light blue; neutral, white; variable, 
25 red; and RGB integrin-binding motif (variable), yellow. 
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nGURE6 is a photograph showing space filling models of 
fibronectin IE modules 9 and 10, in each of two different orientations. The 
two modules and the integrin binding loop (RGB) are labeled. In this figure, 
blue indicates positively charged residues, red indicates negatively charged 
5 residues, and white indicates uncharged residues. 

FIGURE 7 is a photograph showing space filling models of 
fibronectin m modules 7-10, in each of three different orientiations. The four 
modules are labeled. In this figure, blue indicates positively charged residues, 
red indicates negatively charged residues, and white indicates imcharged 
10 residues. 

FIGURE 8 is a photograph illustrating the formation, under different 
salt conditions, of RNA-protein fusions which include fibronectin type IE 
domains. 

FIGURE 9 is a series of photographs illustrating the selection of 
15 fibronectin type HI domain-containing RNA-protein fusions, as measured by 
PGR signal analysis. 

FIGURE 10 is a graph illustrating an increase in the percent TNF-a 
binding during the selections described herein, as well as a comparison 
between RNA-protein fusion and free protein selections. 
20 FIGURE 11 is a series of schematic representations showing IgG, 

'*^n3, Fn-CHi-CHj-CHs, and Fn-CH^-CHj (clockwise firom top left). 

FIGURE 12 is a photograph showing a molecular model of Fn-CHj- 
CH2-CH3 based on known three-dimensional structures of IgG (X-ray 
crystallography) and ^^n3 (NMR and X-ray crystallography). 
25 FIGURE 13 is a graph showing the time course of an exemplary 

^*^^n3-based nucleic acid-protein fusion selection of TNF-a binders. The 
proportion of nucleic acid-protein fusion pool (open diamonds) and free 
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protein pool (open circles) that bound to TNF-a-Sepharose, and the proportion 
of free protein pool (full circles) that bound to underivatized Sepharose, are 
shown. 

FIGURES 14 and 15 are graphs illustrating TNF-a binding by TNF- 
5 a Fn-binders. In particular, these figures show mass spectra data obtained 
from a ^*^n3 fusion chip and non-fusion chip, respectively. 

FIGURES 16 and 17 are the phosphorimage and fluorescence scan, 
respectively, of a ^*^^n3 array, illustrating TNF-a binding. 

Detailed Description 

10 The novel antibody mimics described herein have been designed to 

be superior both to antibody-derived fragments and to non-antibody 
frameworks, for example, those frameworks described above. 

The major advantage of these antibody mimics over antibody 
fragments is structural. These scaffolds are derived from whole, stable, and 

15 soluble structural modules found in human body fluid proteins. Consequently, 
they exhibit better folding and thermostability properties than antibody 
fragments, whose creation involves the removal of parts of the antibody native 
fold, often exposing amino acid residues that, in an intact antibody, would be 
buried in a hydrophobic environment, such as an interface between variable 

20 and constant domains. Exposure of such hydrophobic residues to solvent 
increases the likelihood of aggregation. 

In addition, the antibody mimics described herein have no disulfide 
bonds, which have been reported to retard or prevent proper folding of 
antibody fragments under certain conditions. Since the present scaffolds do 

25 not rely on disulfides for native fold stability, they are stable under reducing 
conditions, unlike antibodies and their fragments which unravel upon disulfide 



-13- 



wo 01/64942 



PCT/USOl/06414 



bond breakdown. 

Moreover, these fibronectin-based scaffolds provide the functional 
advantages of antibody molecules. In particular, despite the fact that the ^^n3 
module is not an inmiunoglobulin, its overall fold is close to that of the 
variable region of the IgG heavy chain (Figure 2), making it possible to display 
the three fibronectin loops analogous to CDRs in relative orientations similar 
to those of native antibodies. Because of this stracture, the present antibody 
mimics possess antigen binding properties that are similar in nature and affinity 
to those of antibodies, and a loop randomization and shuffling strategy may be 
employed in vitro that is similar to the process of affinity maturation of 
antibodies in vivo . 

There are now described below exemplary fibronectin-based 
scaffolds and their use for identifying, selecting, and evolving novel binding 
proteins as well as their target ligands. These examples are provided for the 
purpose of illustrating, and not limiting, the invention. 

■ ^n3 Structural Motif 

The antibody mimics of the present invention are based on the 
stracture of a fibronectin module of type m (Fn3), a common domain found in 
manmialian blood and structural proteins. This domain occurs more than 400 
times in the protein sequence database and has been estimated to occur in 2% 
of the proteins sequenced to date, including fibronectins, tenscin, intracellular 
cytoskeletal proteins, and prokaryotic enzymes (Bork and Doolittle, Proc. Natl. 
Acad Sci. USA 89:8990, 1992; Bork et al.. Nature Biotech. 15:553, 1997; 
Meinke et al., J. BacterioL 175:1910, 1993; Watanabe et al., J. Biol. Chem. 
265:15659, 1990). In particular, these scaffolds include, as templates, the 
tenth module of human Fn3 (^^n3), which comprises 94 amino acid residues. 
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The overall fold of this domain is closely related to that of the smallest 
functional antibody fragment, the variable region of the heavy chain, which 
comprises the entire antigen recognition unit in camel and llama IgG (Figure 1, 
2). The major differences between camel and llama domains and the ^*^^n3 
domain are that (i) ^°Fn3 has fewer beta strands (seven vs. nine) and (ii) the 
two beta sheets packed against each other are connected by a disulfide bridge 
in the camel and llama domains, but not in ^°Fn3. 

The three loops of ^^nS corresponding to the antigen-binding loops 
of the IgG heavy chain run between amino acid residues 21-31, 51-56, and 
76-88 (Figure 3). The length of the first and the third loop, 11 and 12 residues, 
respectively, fall within the range of the corresponding antigen-recognition 
loops found in antibody heavy chains, that is, 10-12 and 3-25 residues, 
respectively. Accordingly, once randomized and selected for high antigen 
affinity, these two loops make contacts with antigens equivalent to the contacts 
of the corresponding loops in antibodies. 

In contrast, the second loop of ^^n3 is only 6 residues long, whereas 
the corresponding loop in antibody heavy chains ranges from 16-19 residues. 
To optimize antigen binding, therefore, tiie second loop of ^^n3 is preferably 
extended by 10-13 residues (in addition to being randomized) to obtain the 
greatest possible flexibility and affinity in antigen binding. Indeed, m general, 
the lengths as well as the sequences of the CDR-like loops of the antibody 
mimics may be randomized during in vitro or in vivo affinity maturation (as 
described in more detail below). 

The tenth human fibronectin type HI domain, ^^n3, refolds rapidly 
even at low temperature; its backbone conformation has been recovered within 
1 second at S^'C. Thermodynamic stability of ^^n3 is high (AGu = 24 kJ/mol = 
5.7 kcal/mol), correlating witii its high melting temperature of 110°C. ' 
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One of the physiological roles of *^n3 is as a subunit of fibronectin, 
a glycoprotein that exists in a soluble form in body fluids and in an insoluble 
form in the extracellular matrix (Dickinson et al., J. Mol. Biol. 236:1079, 
1994). A fibronectin monomer of 220-250 kD contains 12 type I modules, two 
5 type n modules, and 17 fibronectin type HI modules (Potts and Campbell, 
Curr. Opin.Cell Biol. 6:648, 1994). Different type HI modules are involved in 
the binding of fibronectin to integrins, heparin, and chondroitin sulfate. ^*^Fn3 
was found to mediate cell adhesion through an integrin-binding Arg-Gly-Asp 
(RGD) motif on one of its exposed loops. Similar RGD motifs have been 

10 shown to be involved in integrin binding by other proteins, such as fibrinogen, 
von Wellebrand factor, and vitronectin (Hynes et al., Cell 69:11, 1992), No 
other matrix- or cell-binding roles have been described for ^°Fn3. 

The observation that ^°Fn3 has only sUghtly more adhesive activity 
than a short peptide containing RGD is consistent with the conclusion that the 

15 cell-binding activity of *^n3 is locaUzed in the RGD peptide rather than 

distributed throughout the ^*^^n3 structure (Baron et al., Biochemistry 31:2068, 
1992). The fact that ^^n3 without the RGD motif is unlikely to bind to other 
plasma proteins or extracellular matrix makes ^°Fn3 a useful scaffold to replace 
antibodies. In addition, the presence of ^^n3 in natural fibrinogen in the 

20 bloodstream suggests that ^^n3 itself is unlikely to be immunogenic in the 
organism of origin. 

In addition, we have determined that the ^°Fn3 framework possesses 
exposed loop sequences tolerant of randomization, facilitating the generation 
of diverse pools of antibody mimics. This determination was made by 

25 examining the flexibility of the ^^n3 sequence. In particular, the human ^^n3 
sequence was aligned with the sequences of fibronectins from other sources as 
well as sequences of related proteins (Figure 4), and the results of this 
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alignment were mapped onto the three-dimensional structure of the human 
^^n3 domain (Figure 5), This alignment revealed that the majority of 
conserved residues are found in the core of the beta sheet sandwich, whereas 
the highly variable residues are located along the edges of the beta sheets, 
5 including the N- and C-termini, on the solvent-accessible faces of both beta 
sheets, and on three solvent-accessible loops that serve as the hypervariable 
loops for affinity maturation of the antibody mimics. In view of these results, 
the randomization of these three loops are unlikely to have an adverse effect on 
the overall fold or stabihty of the ^°Fn3 framework itself. 

10 For die human ^^n3 sequence, this analysis indicates that, at a 

minimum, amino acids 1-9, 44-50, 61-54, 82-94 (edges of beta sheets); 19, 21, 
30-46 (even), 79-65 (odd) (solvent-accessible faces of both beta sheets); 21-31, 
51-56, 76-88 (CDR-like solvent-accessible loops); and 14-16 and 36-45 (other 
solvent-accessible loops and beta turns) may be randomized to evolve new or 

15 improved compound-binding proteins. In addition, as discussed above, 
alterations in the lengths of one or more solvent exposed loops may also be 
included in such directed evolution methods. Alternatively, changes in the p- 
sheet sequences may also be used to evolve new proteins. These mutations 
change the scaffold and diereby indirectly alter loop structure(s). If this 

20 approach is taken, mutations should not saturate the sequence, but rather few 
mutations should be introduced. Preferably, no more dian 10 amino acid 
changes, and, more preferably, no more than 3 amino acid changes should be 
introduced to the P-sheet sequences by this approach. 

Fibronectin Fusions 

25 The antibody mimics described herein may be fused to other protein 

domains. For example, these niimics may be integrated with the human 
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immune response by fusing the constant region of an IgG (FJ with a ^^n3 
module, preferably through the C-terminus of ^^n3. The in such a ^°Fn3-Fc 
fusion molecule activates the complement component of the immune response 
and increases the therapeutic value of the antibody mimic. Similarly, a fusion 
5 between ^*^^n3 and a complement protein, such as Clq, may be used to target 
cells, and a fusion between ^*^Fn3 and a toxin may be used to specifically 
destroy cells that carry a particular antigen. In addition, ^°Fn3 in any form may 
be fused with albumin to increase its half-life in the bloodstream and its tissue 
penetration. Any of these fusions may be generated by standard techniques, 
10 for example, by expression of the fusion protein from a recombinant fusion 
gene constructed using publically available gene sequences. 



Fibronectin Scaffold Multimers 

In addition to fibronectin monomers, any of the fibronectin 
constructs described herein may be generated as dimers or multimers of 

15 ^*^n3-based antibody mimics as a means to increase the valency and thus the 
avidity of antigen binding. Such multimers may be generated through 
covalent binding between individual ^^n3 modules, for example, by imitating 
the natural ^Fn3-^n3-^^n3 C-to-N-terminus binding or by imitating antibody 
dimers that are held together through their constant regions. A ^*^^n3-Fc 

20 construct may be exploited to design dimers of the general scheme of 

^^n3-Fc::Fc-^*^n3. The bonds engineered into the Fc::Fc interface may be 
covalent or non-covalent. In addition, dimerizing or multimerizing partners 
other than Fc can be used in ^^n3 hybrids to create such higher order 
structures. 
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In particular examples, covalently bonded multimers noiay be 
generated by constructing fusion genes that encode the multimer or, 
alternatively, by engineering codons for cysteine residues into monomer 
sequences and allowing disulfide bond formation to occur between the 
expression products. Non-covalently bonded multimers may also be generated 
by a variety of techniques. These include the introduction, into monomer 
sequences, of codons corresponding to positively and/or negatively charged 
residues and allowing interactions between these residues in the expression 
products (and therefore between the monomers) to occur. This approach may 
be simplified by taking advantage of charged residues naturally present in a 
monomer subxmit, for example, the negatively charged residues of fibronectin. 
Another means for generating non-covalently bonded antibody noimics is to 
introduce, into the monomer gene (for example, at the amino- or carboxy- 
termini), the coding sequences for proteins or protein domains known to 
interact. Such proteins or protein domains include coil-coil motifs, leucine 
zipper motifs, and any of the numerous protein subunits (or fragments thereof) 
known to direct formation of dimers or higher order multimers. 

Fibronectin-Like Molecules 

Although ^*^^n3 represents a preferred scaffold for the generation of 
antibody mimics, other molecules may be substituted for ^^n3 in the 
molecules described herein. These include, without limitation, human 
fibronectin modules ^Fn3-^n3 and ^^Fn3-^^Fn3 as well as related Fn3 modules 
from non-human animals and prokaryotes. In addition, Fn3 modules from 
other proteins with sequence homology to ^^n3, such as tenascins and 
undulins, may also be used. Modules firom different organisms and parent 
proteins may be most appropriate for different applications; for example, in 
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designing an antibody nmnic, it may be most desirable to generate that protein 
from a jRbronectin or fibronectin-like molecule native to the organism for 
which a therapeutic or diagnostic molecule is intended. 

Directed Evolution of Scafifold-Based Binding Proteins 
5 The antibody nmnics described herein may be used in any technique 

for evolving new or improved binding proteins. In one particular example, the 
target of binding is inmiobilized on a solid support, such as a column resin or 
microtiter plate well, and the target contacted with a Kbrary of candidate 
scaffold-based binding proteins. Such a library may consist of ^*^n3 clones 

10 constructed from the wild type ^°Fn3 scaffold through randomization of the 
sequence and/or the length of the ^^n3 CDR-like loops. If desired, this library 
may be an RNA-protein fusion library generated, for example, by the 
techniques described in Szostak et al., U.S.S.N. 09/007,005 and 09/247,190; 
Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci. 

15 USA (1997) vol. 94, p. 12297-12302. Altematively, it may be a DNA-protein 
library (for example, as described in Lohse, DNA-Protein Fusions and Uses 
Thereof, U.S.S.N. 60/110,549, U.S.S.N. 09/459,190, and US 99/28472). The 
fusion library is incubated with the immobilized target, the support is washed 
to remove non-specific binders, and the tightest binders are eluted under very 

20 stringent conditions and subjected to PGR to recover the sequence information 
or to create a new library of binders which may be used to repeat the selection 
process, with or without further mutagenesis of the sequence, A number of 
rounds of selection may be performed until binders of sufficient affinity for the 
antigen are obtained. 



-20- 



wo 01/64942 



PCT/USOl/06414 



In one particular example, the ^^Fn3 scaffold may be used as the 
selection target. For example, if a protein is required that binds a specific 
peptide sequence presented in a ten residue loop, a single ^^n3 clone is 
constructed in which one of its loops has been set to the length of ten and to 
5 the desired sequence. The new clone is expressed in vivo and purified, and 
then immobilized on a solid support. An RNA-protein fusion library based on 
an appropriate scaffold is then allowed to interact with the support, which is 
then washed, and desired molecules eluted and re-selected as described above. 

Similarly, the ^°Fn3 scaffold may be used to find natural proteins 
10 that interact with the peptide sequence displayed in a ^^n3 loop. The ^*^^n3 
protein is innmobilized as described above, and an RNA-protein fusion library 
is screened for binders to the displayed loop. The binders are enriched through 
multiple rounds of selection and identified by DNA sequencing. 

In addition, in the above approaches, although RNA-protein libraries 
15 represent exemplary Ubraries for durected evolution, any type of scaffold-based 
library may be used in the selection methods of the invention. 

Use 

The antibody mimics described herein may be evolved to bind any 
antigen of interest. These proteins have thermodynamic properties superior to 

20 those of natural antibodies and can be evolved rapidly in vitro . Accordingly, 
these antibody mimics may be employed in place of antibodies in all areas in 
which antibodies are used, including in the research, therapeutic, and 
diagnostic fields. In addition, because these scaffolds possess solubility and 
stability properties superior to antibodies, the antibody mimics described 

25 herein may also be used under conditions which would destroy or inactivate 
antibody molecules. Finally, because the scaffolds of the present invention 
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may be evolved to bind virtually any compound, these molecules provide 
completely novel binding proteins which also find use in the research, 
diagnostic, and therapeutic areas. 

Experimental Results 
5 Exemplary scaffold molecules described above were generated and 

tested, for example, in selection protocols, as follows. 

Library construction 

A complex library was constructed from three fragments, each of 
which contained one randomized area correspondmg to a CDR-like loop. The 

10 fragments were named BC, DE, and FG, based on the names of the 
CDR-H-like loops contained within them; in addition to ^*^n3 and a 
randomized sequence, each of the fragments contained stretches encoding an 
N-terminal Hisg domain or a C-terminal FLAG peptide tag. At each junction 
between two fragments (i.e., between the BC and DE fragments or between the 

15 DE and FG fragments), each DNA fragment contained recognition sequences 
for the Earl Type IIS restriction endonuclease. This restriction enzyme 
allowed the splicing together of adjacent fragments while removing all foreign, 
non-^*^n3, sequences. It also allows for a recombination-like mixing of the 
three ^^n3 fragments between cycles of mutagenesis and selection. 

20 Each fragment was assembled from two overlapping 

oligonucleotides, which were first annealed, then extended to form the 
double-stranded DNA form of the fragment. The oligonucleotides that were 
used to constract and process the three fragments are listed below; the 'Top" 
and "Bottom" species for each fragment are the oligonucleotides that contained 

25 the entire ^^n3 encoding sequence. In these oligonucleotides designations. 
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"N" indicates A, T, C, or G; and "S" indicates C or G. 
HfoLbcTop (His): 

5'- GG AAT TCC TAA TAG GAG TCA CTA TAG GGA CAA TTA CTA 
ITT ACA ATT ACA ATG CAT C AC CAT C AC CAT CAC GTT TCT GAT 
5 GTT CCG AGG GAC CTG GAA GTT GTT. GCT GCG ACC CCC ACC 
AGC-3* (SEQ ID NO: 1) 

HfioLbcTop (an alternative N-terminus): 

5'- GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA 
TTT ACA ATT ACA ATG GTT TCT GAT GTT CCG AGG GAC CTG 
10 GAA GTT GTT GCT GCG ACC CCC ACC AGC-3' (SEQ ID NO: 2) 

HFnLBCBot-flagS: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT CTT 
CCC TGT TTC TCC GTA AGT GAT CCT GTA ATA TCT (SNN)7 CCA 
GCT GAT CAG TAG GCT GGT GGG GGT CGC AGC -3' (SEQ ID NO: 3) 

15 HFnBC3'-flag8: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT CTT 
CCC TGT TTC TCC GTA AGT GAT CC-3' (SEQ ID NO: 4) 

HFnLDETop: 

5'- GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA 
20 TTT ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC CTC TTC ACA 
GGA GGA AAT AGC CCT GTC C-3' (SEQ ID NO: 5) 
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HFnLDEBot-flagS: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT CTT 
CGT ATA ATC AAC TCC AGG TTT AAG GCC GCT GAT GGT AGC 
TGT (SN]S04 AGG CAC AGT GAA CTC CTG GAC AGG GCT ATT TCC 
5 TCC TGT -3' (SEQ ID NO: 6) 

HFnDE3'-flag8: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT CTT 
CGT ATA ATC AAC TCC AGG TTT AAG G-3' (SEQ ID NO: 7) 

HFnLFGTop: 

10 5'- GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA 
TTT ACA ATT AC A ATG CAT CAC CAT CAC CAT CAC CTC TTC TAT 
ACC ATC ACT GTG TAT GCT GTC-3' (SEQ ID NO: 8) 

HFnLFGBot-flagS: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC TGT TCG 
15 GTA ATT AAT GGA AAT TGG (SNN)10 AGT GAC AGC ATA CAC AGT 
GAT GGT ATA -3' (SEQ ID NO: 9) 

HFnFG3'-flag8: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC TGT TCG 
GTA ATT AAT GGA AAT TGG -3' (SEQ ID NO: 10) 

20 T7Tmv (introduces T7 promoter and TMV untranslated region needed for in 
vitro translation): 

5'- GCG TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA 
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ATT ACA-3' (SEQ ED NO: 11) 
ASAflagS: 

5'-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC-3' (SEQ ID 
NO: 12) 

5 Unispl-s (spint oligonucleotide used to ligate mRNA to the 

puromycin-containing linker, described by Roberts et al, 1997, supra): 
5'-TTTTTTTITNAGCGGATGC-3' (SEQ ID NO: 13) 

A18--2PEG (DNA-puromycin linker): 
5*-(A)18(PEG)2CCPur (SEQ ID NO: 14) 

10 The pairs of oUgonucleotides (500 pmol of each) were annealed in 

100 (iL of 10 mM Tris 7.5, 50 mM NaCl for 10 minutes at 85°C, followed by a 
slow (0.5-1 hour) cooling to room temperature. The annealed fragments with 
single-stranded overhangs were then extended using 100 U Klenow (New 
England Biolabs, Beverly, MA) for each 100 fiL aliquot of annealed oligos, 

15 and the buffer made of 838.5 fil HjO, 9 /il 1 M Tris 7.5, 5 /il IM MgClj, 20 fil 
10 mM dNTPs, and 7.5 fxl IM DTT. The extension reactions proceeded for 1 
hour at 25°C. 

Next, each of the double-stranded fragments was transformed into a 
RNA-protein fiision (PROfusion™) using the technique developed by Szostak 
20 et al., U.S.S.N. 09/007,005 and U.S.S.N. 09/247,190; Szostak et al., 

WO98/31700; and Roberts & Szostak, Proc. Nati. Acad. Sci. USA (1997) vol. 
94, p. 12297-12302. Briefly, the fragments were transcribed using an Ambion 
in vitro transcription kit, MEGAshortscript (Ambion, Austin, TX), and the 
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resulting mRNA was gel-purified and ligated to a DNA-puromycin linker 
using DNA ligase. The mRNA-DNA-puromycin molecule was then translated 
using the Ambion rabbit reticulocyte lysate-based translation kit. The resulting 
mRNA-DNA-puromycin-protein PROfusion™ was purified using Oligo(dT) 
5 cellulose, and a complementary DNA strand was synthesized using reverse 
transcriptase and the RT primers described above G^nisplint-S or flagASA), 
following the manufacturer's instructions. 

The PROfusion™ obtained for each fragment was next purified on 
the resin appropriate to its peptide purification tag, i.e., on Ni-NTA agarose for 

10 the HiSg-tag and M2 agarose for the FLAG-tag, following die procedure 

recommended by the manufacturer. The DNA component of the tag-binding 
PROfusions™ was amplified by PGR using Pharmacia Ready-to-Go PGR 
Beads, 10 pmol of 5* and 3* PGR primers, and the following PGR program 
(Pharmacia, Piscataway, NJ): Step 1: 95°C for 3 minutes; Step 2: 95°G for 30 

15 seconds, 58/62°G for 30 seconds, 72°G for 1 minute, 20/25/30 cycles, as 
required; Step 3: 72°Gfor 5 minutes; Step 4: 4°G until end. 

The resulting DNA was cleaved by 5 U Earl (New England Biolabs) 
perl ug DNA; the reaction took place in T4 DNA Ligase Buffer (New England 
Biolabs) at 37°G, for 1 hour, and was followed by an incubation at 70°G forl5 

20 minutes to inactivate Ear L Equal amounts of the BG, DE, and FG firagments 
were combined and Ugated to form a full-length ^*^^n3 gene with randomized 
loops. The ligation required 10 U of fresh Earl (New England Biolabs) and 20 
U of T4 DNA Ligase (Promega, Madison, WI), and took 1 hour at 3TC. 

Three different libraries were made in the manner described above. 

25 Each contamed the form of the FG loop with 10 randomized residues. The BG 
and the DE loops of the first library bore the wild type ^^n3 sequence; a BG 
loop with 7 randomized residues and a wild type DE loop made up the second 
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library; and a BC loop with 7 randomized residues and a DE loop with 4 
randomized residues made up the third library. The complexity of the FG loop 
in each of these three libraries was 10^^; the further two randomized loops 
provided the potential for a complexity too large to be sampled in a laboratory. 
5 The three libraries constructed were combined into one master 

library in order to simplify the selection process; target binding itself was 
expected to select the most suitable library for a particular challenge. 
PROfusions™ were obtained from the master library following the general 
procedure described in Szostak et al., U.S.S.N. 09/007,005 and 09/247,190; 
10 Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci, 
USA (1997) vol. 94, p. 12297-12302 (Figure 8). 

Fusion Selections 

The master library in the PROfusion™ form was subjected to 
selection for binding to TNF-a. Two protocols were employed: one in which 

15 the target was inunobilized on an agarose column and one in which the target 
was immobilized on a BIACORE chip. First, an extensive optimization of 
conditions to minimize background binders to the agarose colunm yielded the 
favorable buffer conditions of 50 mM HEPES pH 7.4, 0.02% Triton, 100 
(ig/wl Sheared Salmon Sperm DNA. In this buffer, the non-specific binding of 

20 the ^^Fn3 RNA fusion to TNF-a Sepharose was 0.3%. The non-specific 

binding background of the ^^n3 RNA-DNA to TNF-a Sepharose was found 
to be 0.1%. 

During each round of selection on TNF-a Sepharose, the 
Profusion™ library was &st preincubated for an hour with underivatized 
25 Sepharose to remove any remaining non-specific binders; the flow-through 
from this pre-clearing was incubated for another hour with TNF-a Sepharose. 
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The TNF-a Sepharose was washed for 3-30 minutes. 

After each selection, the PROfusion™ DNA that had been eluted 
from the solid support with 0.3 M NaOH or O.IM KOH was amplified by 
PGR; a DNA band of the expected size persisted through multiple rounds of 
5 selection (Figure 9); similar results were observed in the two altemative 
selection protocols, and only the data from the agarose colunon selection is 
shown in Figure 9. 

In the first seven rounds, the binding of library PROfusions™ to the 
target remained low; in contrast, when free protein was translated from DNA 
10 pools at different stages of the selection, the proportion of the column binding 
species increased significantly between rounds (Figure 10). Similar selections 
may be carried out with any other binding species target (for example, IL-1 and 
IL-13). 

Animal Studies 

15 Wild-type ^^n3 contains an integrin-binding tripepetide motif, 

Arginine 78 - Glycine 79 - Aspartate 80 (the "RGD motif) at the tip of the FG 
loop. In order to avoid integrin binding and a potential inflanmiatory response 
based on this tripeptide in vivo , a mutant form of ^^n3 weis generated that 
contained an inert sequence. Serine 78 - Glycine 79 - Glutamate 80 (the "SGE 

20 mutant"), a sequence which is found in the closely related, wild-type ^^Fn3 

domain. This SGE mutant was expressed as an N-terminally His^-tagged, free 
protein in E. coli , and purified to homogeneity on a metal chelate column 
followed by a size exclusion column. 

In particular, the DNA sequence encoding His6-^^n3(SGE) was 

25 cloned into the pET9a expression vector and transformed into BL21 DE3 
pLysS cells. The culture was then grown in LB broth containing 50 /ig/mL 
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kanamycin at 3TC, with shaking, to A56o=1.0, and was then induced with 0.4 
mM IPTG. The induced culture was further incubated, under the same 
conditions, overnight (14-18 hours); the bacteria were recovered by standard, 
low speed centrifugation. The cell pellet was resuspended in 1/50 of the 
5 original culture volume of lysis buffer (50 mM Tris 8.0, 0,5 M NaCl, 5% 
glycerol, 0.05% Triton X-100, and 1 mM PMSF), and the cells were lysed by 
passing the resulting paste through a Microfluidics Corporation Microfluidizer 
MUO-EH, three times. The lysate was clarified by centrifugation, and the 
supernatant was filtered through a 0.45 /im filter followed by filtration through 

10 a 0.2 iim filter. 100 mL of the clarified lysate was loaded onto a 5 mL Talon 
cobalt column (Clontech, Palo Alto, CA), washed by 70 mL of lysis buffer, 
and eluted with a linear gradient of 0-30 mM imidazole in lysis buffer. The 
flow rate through the column through all the steps was 1 mL/min. The eluted 
protein was concentrated 10-fold by dialysis (MW cutoff = 3,500) against 

15 15,000-20,000 PEG. The resulting sample was dialysed into buffer 1 (lysis 
buffer without the glycerol), then loaded, 5 mL at a time, onto a 16 x 60 mm 
Sephacryl 100 size exclusion column equilibrated in buffer 1. The column was 
run at 0.8 mL/min, in buffer 1; all fractions that contained a protein of the 
expected MW were pooled, concentrated lOX as described above, then 

20 dialyzed into PBS. Toxikon (MA) was engaged to perform endotoxin screens 
and animal studies on the resulting sample. 

In these animal studies, the endotoxin levels in the samples 
examined to date have been below the detection level of the assay. In a 
preliminary toxicology study, this protein was injected into two mice at the 

25 estimated lOOX therapeutic dose of 2.6 mg/mouse. The animals survived the 
two weeks of the study with no apparent ill effects. These results suggest that 
^°Fn3 may be incorporated safely into an IV drag. 
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Alternative Constructs for In Vivo Use 

To extend the half life of the 8 kD ^^n3 domain, a larger molecule 
has also been constructed that mimics natural antibodies. This ^^nS-F^, 
molecule contains the -CH1-CH2-CH3 (Figure 11) or -CH2-CH3 domains of the 
5 IgG constant region of the host; in these constructs, the ^^n3 domain is grafted 
onto the N-terminus in place of the IgG Vh domain (Figures 1 1 and 12). Such 
antibody-like constracts are expected to improve the pharmacokinetics of the 
protein as well as its ability to harness the natural immune response. 

In order to construct the murine form of the ^^Fn3-CHpCH2-CH3 
10 clone, the -CHJ-CH2-CH3 region was &st amplified from a mouse liver spleen 
cDNA Ubrary (Clontech), then Ugated into the pET25b vector. The primers 
used in the cloning were 5' Fc Nest and 3* 5 Fc Nest, and the primers used to 
graft the appropriate restriction sites onto the ends of the recovered insert were 
5' Fc Hm and 3' Fc Nhe: 



15 5' Fc Nest 5'GCG GCA GGG TTT GCT TAG TGG GGC CAA GGG 3' (SEQ 
ID NO: 15); 

3' Fc Nest 5'GGG AGG GGT GGA GGT AGG TCA GAG TCC 3' (SEQ ID 
NO: 16); 

3* Fc Nhe 5' TTT GCT AGG TTT ACC AGG AGA GTG GGA GGC 3' (SEQ 
20 ID NO: 17); and 

5' Fc Hin 5' AAA AAG CTT GCC AAA AGG ACA CCC CCA TCT GTC 3' 
(SEQ ID NO: 18). 



Further PGR is used to remove the CHj region from this clone and 
create the Fc part of the shorter, ^^n3-CH2-CH3 clone. The sequence 
25 encoding ^^n3 is spliced onto the 5' end of each clone; either the wild type 
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^°Fn3 cloned from the same mouse spleen cDNA library or a modified ^°Fn3 
obtained by mutagenesis or randomization of the molecules can be used. The 
oligonucleotides used in the cloning of murine wild-type ^^n3 were: 

Mo 5PCR-NdeI: 

5 5' CATATGGTTTCTGATATTCCGAGAGATCTGGAG 3' (SEQ ID NO: 
19); 

Mo5PCR-His-NdeI (for an altemative N-terminus with the Hisg 
purification tag): 

5* CAT ATG CAT CAC CAT CAC CAT CAC GTT TCT GAT 
10 ATT CCG AGA G 3^ (SEQ ID NO: 20); and 

Mo3PCR-EcoRI: 5' 
GAATTCCTATGTTTTATAATTGATGGAAAC3* (SEQ ID NO: 21). 

The human equivalents of the clones are constructed using the same 
strategy with human oligonucleotide sequences. 

15 ^n3 Scaffolds in Protein Chip Applications 

The suitability of the ^^n3 scaffold for protein chip applications is 
the consequence of (1) its ability to support many binding functions which can 
be selected rapidly on the bench or in an automated setup, and (2) its superior 
biophysical properties. 

20 The versatile binding properties of ^°Fn3 are a function of the loops 

displayed by the Fn3 immunoglobulin-like, beta sandwich fold. As discussed 
above, these loops are similar to the complementarity determining regions of 
antibody variable domains and can cooperate in a way similar to those antibody 
loops in order to bind antigens. In our system, ^^n3 loops BC (residues 
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21-30), DE (residues 51-56), and FG (residues 76-87) are randomized either in 
sequence, in length, or in both sequence and length in order to generate diverse 
libraries of mRNA-^^n3 fusions. The binders in such libraries are then 
enriched based on their affinity for an immobiUzed or tagged target, until a 
5 small population of high affinity binders are generated. Also, error-prone PGR 
and recombination can be employed to facilitate affinity maturation of selected 
bmders. Due to the rapid and efficient selection and affinity maturation 
protocols, binders to a large number of targets can be selected in a short time. 
As a scaffold for binders to be inmiobiUzed on protein chips, the 

10 ^^n3 domain has the advantage over antibody fragments and single-chain 
antibodies of being smaller and easier to handle. For example, unlike 
single-chain scaffolds or isolated variable domains of antibodies, which vary 
widely in their stability and solubility, and which require an oxidizing 
environment to preserve their structurally essential disulfide bonds, '^n3 is 

15 extremely stable, with a melting temperature of 1 10°G, and solubiUty at a 
concentration > 16 mg/mL. The ^^n3 scaffold also contains no disulfides or 
fi:ee cysteines; consequently, it is insensitive to the redox potential of its 
environment. A further advantage of ^*^n3 is that its antigen-binding loops 
and N-terminus are on the edge of the beta-sandwich opposite to the 

20 G-terminus; thus the attachment of a ^^n3 scaffold to a chip by its G-terminus 
aligns the antigen-binding loops, allowing for their greatest accessibility to the 
solution being assayed. Since ^^n3 is a single domain of only 94 an^no acid 
residues, it is also possible to immobilize it onto a chip surface at a higher 
density than is used for single-chain antibodies, with their approximately 250 

25 residues. In addition, the hydrophilicity of the ^^n3 scaffold, which is 

reflected in the high solubility of this domain, leads to a lower than average 
background binding of ^^n3 to a chip surface. 
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The stability of the ^°Fn3 scaffold as well as its suitability for library 
formation and selection of binders are likely to be shared by the large, Fn3-like 
class of protein domains with an immunoglobulin-like fold, such as the 
domains of tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-R, cytokine 
5 receptor, glycosidase inhibitor, and antibiotic chromoprotein. The key features 
shared by all such domains are a stable framework provided by two 
beta-sheets, which are packed against each other and which are connected by at 
least three solvent-accessible loops per edge of the sheet; such loops can be 
randomized to generate a library of potential binders without disrupting the 
10 structure of the framework (as described above). 

Immobilization of Fibronectin Scaffold Binders (Fn-binders) 

To immobiUze Fn-binders to a chip surface, a number of exemplary 
techniques may be utihzed. For example, Fn-binders may be immobilized as 
RNA-protein fusions by Watson-Crick hybridization of the RNA moiety of the 

15 fusion to a base complementary DNA immobilized on the chip surface (as 
described, for example, in Addressable Protein Arrays, U.S.S.N. 60/080,686; 
U.S.S.N. 09/282,734; and WO 99/51773). Alternatively, Fn-binders can be 
immobiMzed as free proteins directly on a chip surface. Manual as weU as 
robotic devices may be used for deposition of the Fn-binders on the chip 

20 surface. Spotting robots can be used for deposition of Fn-binders with high 
density in an array format (for example, by the method of Lueking et al.. Anal 
Biochem. 1999 May 15;270(1): 103-11). Different methods may also be 
utilized for anchoring the Fn-binder on the chip surface. A number of standard 
immobilization procedures may be used including those described in Methods 

25 in Enzymology (K. Mosbach and B. Danielsson, eds.), vols. 135 and 136, 
Academic Press, Orlando, Florida, 1987; Nilsson et al., Protein Expr. Purif. 
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1997 Oct;ll(l):l-16; and references therein. Oriented immobilization of 
Fn-binders can help to increase the binding capacity of chip-bound Fn-binders. 
Exemplary approaches for achieving oriented coupling are described in Lu et 
al., The Analyst (1996), vol. 121, p. 29R-32R; and Turkova, J Chromatogr B 
5 Biomed Sci App. 1999 Feb 5;722(1"2): 1 1-3 1 . In addition, any of the methods 
described herein for anchoring Fn-binders to chip surfaces can also be applied 
to the inmiobilization of Fn-binders on beads, or other supports. 

Target Protein Capture and Detection 

Selected populations of Fn-binders may be used for detection and/or 

10 quantitation of analyte targets, for example, in samples such as biological 
samples. To carry out this type of diagnostic assay, selected Fn-binders to 
targets of interest are immobilized on an appropriate support to form 
multi-featured protein chips. Next, a sample is applied to the chip, and the 
components of the sample that associate with the Fn-binders are identified 

15 based on the target-specificity of the immobilized binders. Using this 
technique, one or more components may be simultaneously identified or 
quantitated in a sample (for example, as a means to carry out sample profiling). 

Methods for target detection allow measuring the levels of bound 
protein targets and include, without limitation, radiography, fluorescence 

20 scanning, mass spectroscopy (MS), and surface plasmon resonance (SPR). 
Autoradiography using a phosphorimager system (Molecular Dynamics, 
Sunn3rvale, CA) can be used for detection and quantification of target protein 
which has been radioactively labeled, e.g., using ^^S methionine. Fluorescence 
scanning using a laser scanner (see below) may be used for detection and 

25 quantification of fluorescently labeled targets. Alternatively, fluorescence 
scanning may be used for the detection of fluorescently labeled ligands which 
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themselves bind to the target protem (e.g., fluorescently labeled target-specilHc 
antibodies or fluorescently labeled streptavidin binding to target-biotin, as 
described below). 

Mass spectroscopy can be used to detect and identify bound targets 
5 based on their molecular mass. Desorption of bound target protein can be 
achieved with laser assistance directly from the chip surface as described 
below. Mass detection also allows determinations, based on molecular mass, 
of target modifications including post-translational modifications like 
phosophorylation or glycosylation. Surface plasmon resonance can be used for 

10 quantification of bound protein targets where the Fn-binder(s) are immobilized 
on a suitable gold-surface (for example, as obtained from Biacore, Sweden). 

Described below are exemplary schemes for selecting Fn binders (in 
this case, Fn-binders specific for the protein, TNF-a) and the use of those 
selected populations for detection on chips. This example is provided for the 

15 purpose of illustrating the invention, and should not be construed as limiting. 

Selection of TNF-a Binders Based on ^°Fn3 Scaffold 

In one exemplary use for fibronectin scaffold selection on chips, an 
^°Fn3-based selection was performed against TNF-a, using a library of human 
^^n3 variants with randomized loops BC, DE, and FG. The library was 
* 20 constructed firom three DNA fragments, each of which contained nucleotide 

sequences that encoded approximately one third of human ^^n3, including one 
of the randomized loops. The DNA sequences that encoded the loop residues 
listed above were rebuilt by oligonucleotide synthesis, so that the codons for 
the residues of interest were replaced by (NNS)n, where N represents any of 
25 the four deoxyribonucleotides (A, C, G, or T), and S represents either C or G. 
The C-terminus of each fragment contained the sequence for the FLAG 
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purification tag. 

Once extended by Klenow, each DNA fragment was transcribed, 
ligated to a puromycin-containing DNA linker, and translated in vitro , as 
described by Szostak et al. (Roberts and Szostak, Proc. Nad. Acad. Sci USA 
5 94:12297, 1997; Szostak et al., U.S.S.N. 09/007,005 and U.S.S.N. 09/247,190; 
Szostak et al., WO98/31700), to generate an mRNA-peptide fiision, which was 
then reverse-transcribed into a DNA-mRNA-peptide fiision. The binding of 
the FLAG-tagged peptide to M2 agarose separated full-length fiision 
molecules from those containing firameshifts or superfluous stop codons; the 

10 DNA associated with the purified full-length fiision was amplified by PGR, 
then the three DNA fragments were cut by Ear I restriction endonuclease and 
ligated to form the fiill length template. The template was transcribed, ligated 
to puromycin-containing DNA Unkers, and translated to generate a 
^*^n3-PROfiision™ library, which was then reverse-transcribed to yield the 

15 DNA-mRNA-peptide fusion library which was subsequently used in the 
selection. 

Selection for TNF-a binders took place in 50 mM HEPES, pH 7.4, 
0.02% Triton-X, 0.1 mg/mL sahnon spemi DNA. The PROfiision™ library 
was incubated with Sepharose-immobilized TNF-a; after washing, the DNA 

20 associated with the tightest binders was eluted with 0.1 M KOH, amplified by 
PGR, and transcribed, ligated, translated, and reverse-transcribed into the 
starting material fi^r the next round of selection. 

Ten rounds of such selection were performed (as shown in Figure 
13); they resulted in a PROfiision™ pool that bound to TNF-a-Sepharose with 

25 the apparent average Kd of 120 nM. Specific clonal components of the pool 
that were characterized showed TNF-a binding in the range of 50-500 nM. 
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Fn-binder Immobilization^ Target Protein Capture, and MALDI-TOF 
Detection 

As a first step toward inmiobilizing the Fn-binders to a chip surface, 
an oligonucleotide capture probe was prepared with an automated DNA 
5 synthesizer (PE BioSystems Expedite 8909) using the solid-support 

phosphoramidite approach. All reagents were obtained from Glen Research. 
Synthesis was initiated with a solid support containing a disulfide bond to 
eventually provide a 3 -terminal thiol functionality. The first four monomers to 
be added were hexaethylene oxide units, followed by 20 T monomers. The 

10 5'-terminal DMT group was not removed. The capture probe was cleaved 
from the soUd support and deprotected with anmionium hydroxide, 
concentrated to dryness in a vacumn centrifuge, and purified by reverse-phase 
HPLC using an acetonitrile gradient in triethylammonium acetate buffer. 
Appropriate fractions from the HPLC were collected, evaporated to dryness in 

15 a vacuum centrifuge, and the 5 -terminal DMT group was removed by 
treatment with 80% AcOH for 30 minutes. The acid was removed by 
evaporation, and the oUgonucleotide was then treated with 100 mM DTT for 
30 minutes to cleave the disulfide bond. DTT was removed by repeated 
extraction with EtOAc. The oligonucleotide was ethanol precipitated fi"om the 

20 remaining aqueous layer and checked for purity by reverse-phase HPLC. 

The 3 -thiol capture probe was adjusted to 250 ^M in degassed IX 
PBS buffer and applied as a single droplet (75 (iL) to a 9x9nmi gold-coated 
chip (Biacore) in an argon-flushed chamber containing a small amount of 
water. After 18 hours at room temperature, the capture probe solution was 

25 removed, and the functionalized chip was washed with 50 mL IX PBS buffer 
(2x for 15 minutes each) with gentle agitation, and then rinsed with 50 noJL 
water (2x for 15 minutes each) in the same fashion. Remaining liquid was 
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carefully removed and the functionalized chips were either used immediately 
or stored at 4°C under argon. 

About Ipmol of ^^n3 fusion pool from the Round 10 TNF-a 
selection (above) was treated with RNAse A for several hours, adjusted to 5X 
5 SSC in 70 fiL, and applied to a functionalized gold chip from above as a single 
droplet A 50 {iL volume gasket device was used to seal the fusion mixture 
with the functionalized chip, and the apparatus was continuously rotated at 
4'*C. After 18 hours the apparatus was disassembled, and the gold chip was 
washed with 50 mL 5X SSC for 10 minutes with gentle agitation. Excess 

10 liquid was carefully removed from the chip surface, and the chip was 

passivated with a blocking solution (IX TBS + 0.02% Tween-20 + 0.25% 
BSA) for 10 minutes at 4°C. Excess liquid was carefixlly removed, and a 
solution containing 500 /ig/mL TNF-a in the same composition blocking 
solution was applied to the chip as a single droplet and incubated at 4°C for 

15 two hours with occasional mixing of the droplet via Pipetman. After removal 
of the binding solution, the chip was washed for 5 minutes at 4''C with gentle 
agitation (50 mL IX TBS + 0.02% Tween-20) and then dried at room 
temperature. A second chip was prepared exactly as described above, except 
fusion was not added to the hybridization mix. 

20 Next, MALDI-TOF matrix (15 mg/mL 

3,5-dimethoxy-4-hydroxycinnamic acid in 1:1 ethanol/10% formic acid in 
water) was uniformly applied to the gold chips with a high-precision 3-axis 
robot (MicroGrid, BioRobotics). A 16-pin tool was used to transfer the matrix 
from a 3 84- well microtiter plate to the chips, producing 200 micron diameter 

25 features with a 600 micron pitch. The MALDI-TOF mass spectrometer 
(Voyager DE, PerSeptive Biosystems) instrument settings were as follows: 
Accelerating Voltage = 25k, Grid Voltage = 92%, Guide Wire Voltage = 
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0.05%, Delay = 200 on, Laser Power = 2400, Low Mass Gate = 1500, 
Negative Ions = off. The gold chips were individually placed on a MALDI 
sample stage modified to keep the level of the chip the same as the level of the 
stage, thus allowing proper flight distance. The instrument's video monitor and 
5 motion control system were used to direct the laser beam to individual naatrix 
features. 

Figures 14 and 15 show the mass spectra from the ^^n3 fusion chip 
and the non-fixsion chip, respectively. In each case, a small number of 200 
micron features were analyzed to collect the spectra, but Figure 15 required 
10 significandy more acquisitions. The signal at 17.5 kDa corresponds to TNF-a 
monomer. 

Fn-binder Immobilization, Target Protein Capture, and Fluorescence Detection 

Pre-cleaned 1x3 inch glass microscope sUdes (Goldseal, #3010) 
were treated with Nanostrip (Cyantek) for 15 minutes, 10% aqueous NaOH at 

15 TO^'C for 3 minutes, and 1% aqueous HCl for 1 minute, thoroughly rinsing 
with deionized water after each reagent. The slides were then dried in a 
vacuum desiccator over anhydrous calcium sulfate for several hours. A 1% 
solution of aminopropytrimethoxysilane in 95% acetone / 5% water was 
prepared and allowed to hydrolyze for 20 minutes. The glass slides were 

20 immersed m tiie hydrolyzed silane solution for 5 minutes with gentle agitation. 
Excess silane was removed by subjecting the slides to ten 5-nninute washes, 
using fresh portions of 95% acetone / 5% water for each wash, with genfle 
agitation. The slides were then cured by heating at 110°C for 20 minutes. The 
silane treated slides were immersed in a freshly prepared 0.2% solution of 

25 phenylene 1,4-diisothiocyanate in 90% DMF / 10% pyridine for two hours, 
witii gentie agitation. The slides were washed sequentially with 90% DMF / 



-39- 



wo 01/64942 



PCT/USOl/06414 



10% pyridine, methanol, and acetone. After air drying, the functionalized 
slides were stored at 0°C in a vacuum desiccator over anhydrous calcium 
sulfate. Similar results were obtained with commercial amine-reactive slides 
(3-D Link, Surmodics). 
5 Oligonucleotide capture probes were prepared with an automated 

DNA synthesizer (PE BioSystems Expedite 8909) using conventional 
phosphoramidite chemistry. All reagents were from Glen Research. Synthesis 
was initiated with a solid support bearing an orthogonally protected amino 
functionality, whereby the 3 -terminal anmie is not unmasked until final 

10 deprotection step. The first four monomers to be added were hexaethylene 
oxide units, followed by the standard A, G, C and T monomers. All capture 
oligo sequences were cleaved from the solid support and deprotected with 
ammonium hydroxide, concentrated to dymess, precipitated in ethanol, and 
purified by reverse-phase HPLC using an acetonitrile gradient in 

15 triethylammonium acetate buffer. Appropriate fractions from the HPLC were 
collected, evaporated to dryness in a vacuum centrifuge, and then coevaporated 
with a portion of water. 

The purified, anoine-labeled capture oligos were adjusted to a 
concentration of 250 juM in 50 mM sodium carbonate buffer (pH 9.0) 

20 containing 10% glycerol. The probes were spotted onto the amine-reactive 
glass surface at defined positions in a 5x5x6 array pattem with a 3-axis robot 
(MicroGrid, BioRobotics). A 16-pin tool was used to transfer the liquid from 
384-well microtiter plates, producing 200 micron features with a 600 micron 
pitch. Each sub-grid of 24 features represents a single capture probe (i.e., 24 

25 duplicate spots). The arrays were incubated at room temperature in a 

moisture-saturated environment for 12-18 hours. The attachment reaction was 
terminated by immersing the chips in 2% aqueous anmionium hydroxide for 
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five minutes with gentle agitation, followed by rinsing with distilled water (3X 
for 5 minutes each). The array was finally soaked in lOX PBS solution for 30 
minutes at room temperature, and then rinsed again for 5 minutes in distilled 
water. 

5 Specific and thermodynamically isoenergetic sequences along the 

^^n3 mRNA were identified to serve as capture points to self-assemble and 
anchor the ^^n3 protein. The software program HybSimulator v4,0 
(Advanced Gene Computing Technology, Inc.) facilitated the identification 
and analysis of potential capture probes. Six unique capture probes were 

10 chosen and printed onto the chip, three of which are complementary to 

common regions of the ^^n3 fusion pool's mRNA (CPS', CPS', and CPflag). 
The remaining three sequences (CPnegl, CPneg2, and CPneg3) are not 
complementary and function in part as negative controls. Each of the capture 
probes possesses a 3'-andno terminus and four hexaethylene oxide spacer units, 

15 as described above. The following is a list of the capture probe sequences that 
were employed (5'-*3'): 



CP3': TGTAAATAGTAATTGTCCC (SEQ ID NO: 22) 
CPS': TTTT'llllllllTTTTTTTT (SEQ ID NO: 23) 
CPnegl : CCTGTAGGTGTCCAT (SEQ ID NO: 24) 
20 CPflag: CATCGTCCTTGTAGTC (SEQ ID NO: 2S) 
CPneg2: CGTCGTAGGGGTA (SEQ ID NO: 26) 
CPneg3: CAGGTCTTCTTCAGAGA (SEQ ID NO: 27) 



About Ipmol of ^*^n3 fusion pool from die Round 10 TNF-a selection was 
adjusted to SX SSC containing 0.02% Tween-20 and 2 mM vanadyl 
2S ribonucleotide complex in a total volume of 3S0 [iL. The entire volume was 
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applied to the microarray under a 400 fiL gasket device and the assembly was 
continuously rotated for 18 hours at room temperature. After hybridization the 
slide was washed sequentially with stirred 500 mL portions of 5X SSC, 2.5X 
SSC, and IX SSC for 5 minutes each. Traces of liquid were removed by 
5 centrifugation and the slide was allowed to air-dry. 

Recombinant human TNF-a (500 /ig, lyophilized, from PreproTech) 
was taken up in 230 jixL IX PBS and dialyzed against 700 mL stirred IX PBS 
at 4°C for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce). The 
dialyzed TNF-a was treated with EZ-Link NHS-LC-LC biotinylation reagent 

10 (20 (ig, Rerce) for 2 hours at 0°C, and again dialyzed against 700 mOL stirred 
IX PBS at 4°C for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce). 
The resulting conjugate was analyzed by MALDI-TOF mass spectrometry and 
was foimd to be almost completely functionalized with a single biotin moiety. 
Each of the following processes was conducted at 4°C with 

15 continuous rotation or mixing. The protein microarray surface was passivated 
by treatment with IX TBS containing 0.02% Tween-20 and 0.2% BSA (200 
liL) for 60 minutes. Biotinylated TNF-a (100 nM concentration made up in 
the passivation buffer) was contacted with the microarray for 120 minutes. 
The microarray was washed with IX TBS containing 0.02% Tween-20 (3X 50 

20 mL, 5 minutes each wash). Fluorescently labeled streptavidin (2.5 jLtg/mL 
Alexa 546-streptavidin conjugate from Molecular Probes, made up in the 
passivation buffer) was contacted with the microarray for 60 minutes. The 
microarray was washed with IX TBS containing 0.02% Tween-20 (2X 50 mL, 
5 minutes each wash) followed by a 3 minute rinse with IX TBS. Traces of 

25 liquid were removed by centrifugation, and the slide was allowed to air-dry at 
room temperature. 
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Fluorescence laser scanmng was performed with a GSI Lumonics 
ScanArray 5000 system using 10 [iM pixel resolution and preset excitation and 
emission wavelengths for Alexa 546 dye. Phosphorimage analysis was 
performed with a Molecular Dynamics Storm system. Exposure time was 48 
5 hours with direct contact between the microarray and the phosphor storage 
screen, Phosphorimage scanning was performed at the 50 fiM resolution 
setting, and data was extracted with ImageQuant v,4.3 software. 

Figures 16 and 17 are the phosphorimage and fluorescence scan, 
respectively, of the same array. The phosphorimage shows where the *^n3 
10 fusion hybridized based on the ^^S methionine signal. The fluorescence scan 
shows where the labeled TNF-a bound. 



Other Embodiments 
Other embodiments are within the claims. 

All publications, patents, and patent applications mentioned herein 
15 are hereby incorporated by reference. 

What is claimed is: 
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Claims 

1. An array of proteins immobilized on a solid support, each of said 
proteins comprising a fibronectin type HI domain having at least one 
randomized loop, at least one randomized P-sheet, or a combination thereof, 

5 and being characterized by its ability to bind to a compound that is not bound 
by a corresponding naturally-occurring fibronectin. 

2. The array of claim 1, wherein said fibronectin type IE domain is a 
mammalian fibronectin type HI domain. 

3. The array of claim 2, wherein said fibronectin type in domain is a 
10 human fibronectin type HI domain. 

4. The array of claim 1, wherein each of said proteins comprises the 
tenth module of said fibronectin type HI domain (^*^n3). 

5. The array of claim 4, wherein each of said proteins contains one, 
two, or three randomized loops and wherein at least one of said loops 

15 contributes to the bindiag of the protein to said compound, 

6. The array of claim 5, wherein at least two of said randomized 
loops contribute to said binding of the protein to said compound. 

7. The array of claim 6, wherein at least three of said randomized 
loops contribute to said binding of the protein to said compound. 



.44- 



wo 01/64942 



PCT/USOl/06414 



8. The array of claim 4, wherein said ^^n3 lacks an integrin-binding 

motif. 

9. The array of claim 1, wherein each of said proteins lacks 
disuUBde bonds. 

5 10, The array of claim 1, wherein each of said proteins is a 

monomer or a dimer, 

11. The array of claim 1, wherein each of said proteins is covalently 
bound to a nucleic acid. 

12. The array of claim 1 1, wherein said nucleic acid encodes the 
10 covalently bound protein. 

13. The array of claim 12, wherein said nucleic acid is RNA. 

14. The array of claim 1, wherein said solid support is a chip. 

15. A method for obtaining a protein which binds to a compound, 
said method comprising: 

15 (a) contacting said compound with an array of candidate proteins 

immobilized on a solid support, each of said candidate proteins comprising a 
fibronectin type HI domain having at least one randomized loop, one 
randomized P-sheet, or a combination thereof, said contacting being carried out 
under conditions that allow compound-protein complex formation; and 

20 (b) obtaining, from said complex, a protein which binds to said 

compound. 
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16. A method for obtaining a connLpound which binds to a protein, 
said protein comprising a fibronectin type m domain having at least one 
randomized loop, at least one randomized p-sheet, or a combination thereof, 
said method comprising: 

5 (a) contacting an array of proteins inmiobilized on a solid support 

with a candidate compound, each of said proteins comprising a fibronectin 
type in domain having at least one randomized loop, one randomized p-sheet, 
or a combination thereof, said contacting being carried out under conditions 
that allow compound-protein complex formation; and 
10 (b) obtaining, from said complex, a compound which binds to a 

protein of the array. 

17. The method of claim 15, said method further comprising the 

steps of: 

(c) further randomizing a protein which binds to said compound in 

15 step (b); 

(d) forming an array on a solid support with the further randomized 
proteins of step (c); and 

(e) repeating steps (a) and (b) using, in step (a), the array of further 
randomized proteins as said array of candidate proteins. 

20 18. The method of claim 16, said method further comprising the 

steps of: 

(c) modifying the compound which binds to said protein in step (b); 

and 

(d) repeating steps (a) and (b) using, in step (a), said further 
25 modified compound as said candidate compound. 
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19. The method of claim 15 or 16, wherein said solid support is a 

chip. 

20. A method for detecting a compound in a sample, said method 
comprising: 

5 (a) contactiQg a sample with a protein which binds to said compound 

and which comprises a fibronectin type m domain having at least one 
randomized loop, at least one randomized p-sheet, or a combination thereof, 
said contacting being carried out under conditions that allow compound- 
protein complex formation; and 
10 (b) detecting said complex, thereby detecting said compound in said 

sample. 

21. The method of claim 20, wherein said sample is a biological 

sample. 

22. The method of claim 20, wherein said protein is immobilized on 
15 a solid support. 

23. The method of claim 22, wherein said protein is immobilized on 
said solid support as part of an array. 

24. The method of claim 22, wherein said solid support is a bead or 

chip. 

20 25. The method of claim 15, 16 or 20, wherein said compound is a 

protein. 
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26. The method of claim 15, 16, or 20, wherein said fibronectin type 
m domain is a mammalian fibronectin type III domain. 

27. The method of claim 26, wherein said fibronectin type IE 
domain is a human fibronectin type HI domain. 

5 28. The method of claim 15, 16, or 20, wherein each of said proteins 

comprises the tenth module of said fibronectin type in domain (^^Fn3). 

29. The method of claim 28, wherein each of said proteins contains 
one, two, or three, randomized loops and wherein at least one of said loops 
contributes to the binding of said protein to said compoimd. 

10 30. The method of claim 28, wherein said ^^n3 lacks an integrin- 

binding motif. 

31. The method of claim 15, 16, or 20, wherein each of said proteins 
is covalenUy bound to a nucleic acid. 

32. The method of claim 31, wherein said nucleic acid encodes the 
15 covalently bound protein, 

33. The method of claim 32, wherein said nucleic acid is RNA. 

34. The method of claim 15, 16, or 20, wherein said complex or said 
compound is detected by radiography, fluorescence detection, mass 
spectroscopy, or surface plasmon resonance. 
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